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Abstract — The throughput of multicell systems is inherently 
limited by interference and the available communication re- 
sources. Coordinated resource allocation is the key to efficient 
performance, but the demand on backhaul signaling and compu- 
tational resources grows rapidly with number of cells, terminals, 
and subcarriers. To handle this, we propose a novel multicell 
framework with dynamic cooperation clusters where each termi- 
nal is jointly served by a small set of base stations. Each base 
station coordinates interference to neighboring terminals only, 
thus limiting backhaul signalling and making the framework 
scalable. This framework can describe anything from interference 
channels to ideal joint multicell transmission. 

The resource allocation (i.e., precoding and scheduling) is 
formulated as an optimization problem (FT} with performance 
described by arbitrary monotonic functions of the signal-to- 
interference-and-noise ratios (SINRs) and arbitrary linear power 
constraints. Although {FT} is non-convex and difficult to solve 
optimally, we are able to prove: 1) Optimality of single-stream 
beamforming; 2) Conditions for full power usage; and 3) A 
precoding parametrization based on a few parameters between 
zero and one. These optimality properties are used to propose 
low-complexity strategies: both a centralized scheme and a 
distributed version that only requires local channel knowledge 
and processing. We evaluate the performance on measured mul- 
ticell channels and observe that the proposed strategies achieve 
close-to-optimal performance among centralized and distributed 
solutions, respectively. In addition, we show that multicell in- 
terference coordination can give substantial improvements in 
sum performance, but that joint transmission is very sensitive to 
synchronization errors and that some terminals can experience 
performance degradations. 

Index Terms — Channel measurements, dynamic cooperation 
clusters, low-complexity distributed strategies, multicell multi- 
antenna system, optimality properties, resource allocation. 
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I. Introduction 

IN conventional cellular systems, each terminal belongs 
to one cell at a time and data transmission is scheduled 
autonomously by its base station. We consider systems where 
each base station can divide its transmit resources between 
terminals using orthogonal frequency-division multiple access 
(OFDMA), which generates independent subcarriers (TJ. In 
addition, multiple terminals can be assigned to each subcarrier 
using space division multiple access (SDMA) and multiple- 
input multiple-output (MIMO) techniques that manage co- 
terminal interference within the cell [2|. However, with base 
stations performing autonomous single-cell processing, the 
performance is fundamentally limited by interference from 
other cells — especially for terminals close to cell edges. 

The limiting inter-cell interference can be handled by base 
station coordination, recently termed network MIMO \ 3 1 and 
coordinated multi-point transmission (CoMP) |4|. By sharing 
data and channel state information (CSI) over the backhaul, 
base stations can coordinate the interference caused to ad- 
jacent cells, and cell edge terminals can be jointly served 
through multiple base stations [5]-[7|. The multicell capacity 
was derived in |8j, while more practical performance gains 
over conventional single-cell processing were reported in [9|- 
(TTJ under constrained backhaul signaling. In practice, the 
transmission optimization is also constrained by computational 
complexity and the difficulty of obtaining reliable CSI, which 
makes centralized implementations of multicell coordination 
intractable in large networks fTT) . 

In practical multicell systems, only a small subset of 
base stations will take the interference generated at a given 
terminal into consideration (to limit the backhaul signaling 
and complexity, and to avoid estimating negligibly weak 
channels). Fixed cooperation clusters were considered in |9j 
to (iteratively) coordinate transmissions within each cluster. 
While easily implementable for co-located base stations (such 
as sectors connected to the same eNodeB in an LTE system), 
the performance is still limited by out-of-cluster interference. 
A more dynamic approach was taken in p2) where base 
stations serve partially overlapping sets of terminals. Still, 
global interference coordination was assumed, making the 
approach infeasible in large systems. 

Resource allocation is very difficult to solve optimally, even 
under simplifying assumptions such as a single subcarrier, 
global interference coordination, and perfect CSI. A few type 
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of problems can be solved using uplink-downlink duality 
p3l-|[T6|, but weighted sum rate optimization and similar 
problems are all NP-hard | [T7[ . There are algorithms that find 
the optimal solutions 1 18 1, |19[, but with unpractically slow 
convergence. Suboptimal iterative solutions that perform well 
on synthetic channels have been suggested in (12| , J20)-J22). 
However, synthetic and real channels usually differ due to the 
simplifications used in the channel model assumptions. The 
performance of downlink multicell coordination has not been 
evaluated on measured channels, thus characteristics such as 
the correlation between channels from different base stations 
have not been considered |23|. 

Herein, we analyze coordinated multicell OFDMA transmis- 
sion, derive properties of the optimal resource allocation, pro- 
pose low-complexity strategies, and analyze the performance 
on measured channels. The major contributions are: 

• We propose a general multicell cooperation framework 
that enables unified analysis of anything from interference 
channels to ideal network MIMO. The main characteristic 
is that each base station is responsible for the interference 
caused to a set of terminals, while only serving a subset 
of them with data (to limit backhaul signaling). 

• Multicell OFDMA transmission and resource allocation is 
formulated as an optimization problem (jFTJ with arbitrary 
monotonic utility functions (e.g., representing data rates, 
error rates, or mean square errors), single user detection, 
and arbitrary linear power constraints. 

• Three properties of the optimal solution to ( |P1| | are 
derived: 1) Optimality of single-stream beamforming; 2) 
Conditions for full power usage; 3) An explicit precoding 
parametrization based on a few real-valued parameters 
between zero and one. This novel parametrization im- 
proves prior work in (24)-[28 1 by supporting general 
multicell and multicarrier systems and generally requiring 
much fewer parameters. 

• Two low-complexity strategies for resource allocation are 
proposed based on the three optimality properties and 
an efficient algorithm for subcarrier allocation. The cen- 
tralized strategy provides close-to-optimal performance, 
while the distributed version is suitable for large systems 
with many subcarriers and where the backhaul and com- 
putational resources required for the iterative solutions in 
(12), (20)-(22) are unavailable. 

• As the performance of any communication system de- 
pends on the channel characteristics, realistic condi- 
tions are necessary for reliable evaluation. Therefore, the 
proposed strategies are evaluated on measured channel 
vectors from a typical urban macro-cell environment. 
The impact of multicell coordination is evaluated both 
in terms of average performance and for fixed terminal 
locations, and the robustness to synchronization imper- 
fections is studied. 

Preliminary single-carrier results were reported in (29]. 



5(1), . . . , <S(|<S|) where |«S| is the cardinality. 



II. General Multicell System Model 

We consider a downlink system with K t transmitters, where 
transmitter j is equipped with Nj antennas. They communicate 
over K c independent subcarriers with K r receivers having 
one effective antenna each0The transmitters and receivers are 
denoted BSj and MS&, respectively, for j G J = {1, . . . , K t } 
and k € K = {1, . . . , K r }. 

In a general multicell scenario, some terminals are served 
in a coordinated manner by multiple transmitters. In addition, 
some transmitters and receivers are very far apart, making it 
impractical to estimate and separate the interference on these 
channels from the noise. Based on these observations, we 
propose a general multicell coordination framework: 

Definition 1. Dynamic cooperation clusters means that BSj 

• Has channel estimates to receivers in Cj C {l,...,K r }, 
while interference generated to receivers k £ Cj is 
negligible and can be treated as background noisej^] 

• Serves the receivers in T>j C Cj with data. 

This coordination framework is characterized by the sets 
Cj , T>j for all j E J, which are illustrated in Fig. [T] The 
mnemonic rule is that T>j describes data from transmitter j, 
while Cj describes coordination from transmitter j. How to 
select these sets efficiently, by only accepting the overhead 
involved in interference coordination and joint transmission 
if the expected performance gains are substantial, is a very 
interesting and important problem. The solution depends on 
the system architecture and is beyond the scope of this paper 
(see [4 1, 1 30 1-| 32 1), but a simple scheme would be to include 
MSfc in Cj if the long-term average channel gain from BSj 
is above a certain threshold. If the gain is above a second 
threshold, the terminal is also included in T)j. In principle, 
different Cj , T>j can be used for each subcarrier, but it is not 
necessary since subcarrier allocation naturally appears in the 
resource allocation analyzed herein. Observe that although 
Cj,Vj can be selected decentralized at transmitter j, some 
mechanism for coordination and data sharing is required 
between adjacent transmitters. 

At subcarrier c, the channel from BSj to MSfc is denoted 



NjXl 
T 



Notation: X T , ~K H , and denote the transpose, the 
conjugate transpose, and the Moore-Penrose inverse of X, 
respectively. Ijy and Ojv are N x N identity and zero 
matrices, respectively. If S is a set, then its members are 



h fec = [hj kc . . . hl tkc f e C N , where N = ^ jeJ Nj is the 
total number of transmit antennas. Based on the framework in 
Definition [T] only certain channel elements in h/- c will carry 
data and/or non-negligible interference. These can be selected 
by the diagonal matrices T> k e C NxN and C fe e C NxN 

1 This model applies to multi-antenna receivers that fix their linear receivers 
prior to transmission optimization. This case is relevant both for low- 
complexity transceiver design and as part of iterative transmitter/receiver 
optimization algorithms. 

2 This means that BSj has CSI to all users that receive non-negligible 
interference from BSj — a natural assumption since these are the users where 
BSj can achieve reliable channel estimates. But compared with autonomous 
single-cell processing, it requires additional estimation, feedback, and back- 
haul resources not necessarily available in all system architectures. 
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Fig. 1. Intersection between three cells. BSj serves terminals in the inner circle (Dj) and coordinates interference to terminals in the outer circle (Cj). 
Ideally, negligible interference is caused to terminals outside both circles. 



defined as 



The transmission is limited by L linear power constraints, 



D 



Di fc 




Cifc 







C K t k 



where Djfc = 



N , 



where C 



if k G T>j, 
otherwise, 

(1) 
if k G Cj, 
otherwise. 
(2) 

Thus, h^,Dfc is the channel that carries data to MSfc and 
hj^C/;. is the channel that carries non-negligible interference. 
If the received signal by MSfc at subcarrier c is denoted 
y kc G C, we have 



Ni 



Kr 



where Sfc c G 



Vkc 



*Nxl 



hf c Cfc]TD fe s fcc + 



(3) 



k=l 



is the data symbol vector to MSfc. This 



random vector is zero-mean and has signal correlation matrix 
Sfc c = E{sfc c s^}. In essence, resource allocation means 
selection of Sfc c . The special case of rank(Sfc c ) = 1 is known 
as single-stream beamforming and is particulary simple to 
implement |33|, but for now we let Sfc c have arbitrary rank in 
the performance optimization. 

The additive zero-mean term n kc has variance a\ r and it 
models both noise and weak interference from the transmitters 
with k £ Cj (see Definition [TJ. This assumption limits the 
amount of CSI required to model the transmission and is 
reasonable if transmitters coordinate interference to all cell 
edge terminals of adjacent cells. In the analysis, BSj is 
assumed to know the channels hj kc perfectly to all MSfc with 
k G Cj. 

In (|3), perfect synchronization is assumed between trans- 
mitters that jointly serve a terminal with data (synchronization 
uncertainty is considered in Section |VI) , Joint transmissions 
are also assumed to create synchronous interference [34|; this 
is reasonable for coordination between adjacent cells, while 
larger scales would require unacceptably long cyclic prefixes. 



^^tr{Q ; Sfc c } <q h 

k=l c=l 



(4) 



represented by matrices Q; >z On- To ensure that the total 
power is constrained and only is allocated to dimensions used 
for transmission, these matrices must satisfy two conditions: 
a) Q ;fe - Df QjfcDfc is diagonal and b) J2i=i Qlk >- ON^k. 
These constraints can, for example, be defined on the total 
transmit power (most power efficient), per-transmitter power 
(controls the radiated power in an area), or per-antenna power 
(protects the dynamic range of each power amplifier). It 
is straightforward also include subcarrier specific constraints 
(e.g., frequency masks), but that would complicate the nota- 
tion. 

A. Examples: Two Simple Multicell Scenarios 

The purpose of the proposed dynamic cooperation clusters is 
to jointly describe and analyze a variety of multicell scenarios. 
Typical examples are ideal network MIMO [7] (where all 
transmitters serve all terminals) and interference channels J26l 
(with only one unique terminal per transmitter): 

1 ) Ideal Network MIMO: All transmitters serve and coor- 
dinate interference to all terminals, meaning that Dfc = Cfc = 
Ijv for all k. If a total power constraint is used, then L = 1 
and Qi = Ijy. If per-antenna constraint are used, then L = N 
and Q; is only non-zero at the kh diagonal element. 

2) Two-user Interference Channel: Let BSfc serve MSfc and 
coordinate interference to the other receiver. Then, Di = 
r IjVl °l and D 2 = [° ° 1, while Ci = C 2 = 1^. If each 
transmitter has its own total power constraint, then L = 2 and 
Q ; = T>i for I = 1,2. 

III. Problem Formulation 

In this section, we will define two optimization problems. 
The main problem is to maximize the system performance 
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with arbitrary monotonic utility functions ( |P1| |, which in 
general is a non-convex problem. For this reason, we also 
consider the problem to maximize performance with individual 
quality constraints ( |P2| i, which is convex and thus efficiently 
solved using general-purpose optimization algorithms p5| 
and solvers p6) . In both cases, we make the assumption of 
single-user detection (SUD) [26], which means that receivers 
treat co-terminal interference as noise (i.e., not attempting to 
decode and subtract interference). This assumption leads to 
suboptimal performance, but is important to achieve simple 
and practical receivers. Under SUD, the signal-to-interference- 
and-noise ratio (SINR) for MSfc at subcarrier c becomes 

SINR° c L (S lc ,...,S^ c ) 



h kc C k D k S k c D k C k h fc c 
^ c +h£C fc (EDfcS £c D«)C£h fc 



fc^fc 

hf c D fc S fcc D^li, 



(5) 



kc 



<?l+KCk( £ D,S, c Df )Cf h k 

kei k 



where the last equality follows from CfcDfc = Dfc and that 
CfeDj, 7^ only for terminals fc in 



Ik — 



U 



(6) 



{j£j;k£Cj} 



This set contains all co-terminals that are served by the 
transmitters that serve MSfc. 

The achievable performance of a terminal can be measured 
in different ways, but the most common quality measures are 
all monotonic functions of the SINR: data rate, mean square er- 
ror (MSE), and bit/symbol error rate (BER/SER) (37). Herein, 
we describe the performance of MSfc by an arbitrary quality 
functionofc(-) of the SINRs that is strictly monotonically in- 
creasing^ each argument: ^(SINR^, . . . ,SINR°^J. The 
system performance represents a balance between maximizing 
the performance achieved by the different terminals. Herein, 
we represent it by an arbitrary system utility function /(•) of 
the terminal performances that is also strictly monotonically 
increasing. We arrive at the following general optimization 
problem: 

maximize / (51 , . . . , g Kr ) 

{ s fcc}fc=i, c =i 



subject to Sfc c >: Vfc, c, 



g k = g k ({SINR° c L (S lc , . . . , S^c)}^ 
^^tr{QjS fec } < qi VI. 



Vfc, 



fc=l c=l 

The system utility function /(■) can for example 
represent the weighted max-min terminal performance 
f(gi,...,gK r ) = min k g k /n k or the weighted sum 
performance f(gi, ■ ■ ■ ,gK r ) = SfcMfcSfc f° r some collection 
of weights (Ltfe > 0. Although many other functions are 
possible, it is worth noting that all reasonable performance 

3 We use the convention that the performance measure is a function to be 
maximized. Thus, if the problem is to minimize the error (e.g., MSE, BER, 
or SER), we maximize the error with a negative sign. 



optimizations can be expressed in terms of these utility 
functions]^] Observe that ( |P1| > solves the whole resource 
allocation (i.e., precoding and scheduling over subcarriers) 
as a single optimization problem. Thus, it is not unexpected 
that (|PT) is non-convex and generally NP-hard fT7) , making 
it very difficult to design numerical search algorithms with 
global convergence. 

The second optimization problem is designed to be easier 
to solve. It is based on satisfying predefined quality of service 
(QoS) constraints {7fc c }; that is, SINR^ > j kc for each 
terminal fc and subcarrier c. We pose it as the following 
feasibility problem: 

find {S fec }f^ c = i 
subject to SINR^(Si c , . . . , S KrC ) > lkc Vfc, c, 



(P2) 



]T^tr{Q z S fcc } < qi VI, 

k=l c=l 

Sfc c h Vfc, c. 

This problem is similar to the convex single-carrier problems 
in fT3")-fT6), but a main difference from ( |P2[ ) is that they 
multiply each qi with a parameter j3 and minimize over it. Such 
power minimization might lead to > 1, which corresponds 
to breaching some of the power constraints. Fortunately, the 
multi-carrier problem in ( |P2| > with fixed power constraints is 
also convex: 

Lemma 1. The constraints in ( |P2| ) can be formulated as semi- 
definite constraints and thus (|P2b is convex. 



Proof: The power constraints are already semi-definite 
and the QoS constraints can be reformulated in a semi-definite 

way as 



]T tr{ 7fcc Df Cfhfcch&CfcDfcSsJ 



fcei fc 



(7) 



tr{Df h fcc h£DfcSfc. c } < - 7 fc c CT 



kc 



by moving the denominator of SINR fec to the right hand side 
of the constraint and exploiting that & H h = tr{a ff b} = 
tr{ba ff } for any compatible vectors a and b. ■ 
The convexity makes it easier to analyze and solve ( |P2[ ). In 
Section IV ( |P1| | and |P2) are analyzed and common properties 
of their solutions are derived. The joint analysis is based on the 
following relationship, which is easily proved by contradiction: 

(PI) Lemma 2. If the optimal SINRs, SINR£° L , for each terminal 
fc and subcarrier c in ( |P1[ ) are used as QoS constraints j kc in 
( |P2| i, then all solutions to ( |P2| i are also optimal for ( |P1 [ >. 

Thus, any property of |P2) that holds for any set of 
{7fcc} also holds for |PTi. The price of achieving the convex 
problem formulation in (P2) is that the system must propose 
the individual QoS constraints. In general the optimal QoS 



4 Consider the region of achievable performance points (31, . . . ,gK r )- If 
this region is convex, there exists a supporting hyperplane ^Zfc Mfcfffc = 
constant for each point on the outer boundary [38 Theorem 1.5]. The weights 
)Xk of this hyperplane defines a weighted sum performance optimization that 
ends up in that point. Even if the region is non-convex, a line can be drawn 
from the origin to any point on the outer boundary and the direction of this 
line defines a weighted max-min terminal performance optimization that ends 
up in that boundary point. 
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values of (jPTJ are unknown, but the solution to (jFTJ can in 
theory be achieved by iteratively solving ( |P2| > for different 
QoS constraints. Unfortunately, the available algorithms either 
have slow convergence as in fl8) , fT9| or cannot guarantee 
global convergence as in fl2), |2l), [|22[. This motivates the 
search for properties of the optimal solutions that can simplify 
the optimization or be used to achieve efficient suboptimal 
solutions. 

IV. Properties of Optimal Resource Allocation 

In this section, we will derive three properties of the optimal 
solutions to fp"T] > and ( jP2"l >: 

• Optimality of single-stream beamforming; 

• Conditions for full power usage; 

• Parametrization of optimal precoding strategies. 
Taking these optimality properties into account when solving 
([FT} will greatly reduce the search space for optimal solu- 
tions. They are used in Section [V] to achieve low-complexity 
solutions that perform very well in the measurement-based 
evaluation in Section [VlJ 

A. Optimality of Single-Stream Beamforming 

The first optimality property of ( |P1[ ) and ( |P2[ ) is the suf- 
ficiency of considering signal correlation matrices Sk c that 
are rank one. This might seem intuitive and is often assumed 
for single-antenna receivers without discussion (see fT2) , fT3) , 
[16 1, |20|-|22|). However, the following toy example shows 
that it is actually not a necessary condition under general utility 
functions and power constraints (i.e., high-rank solutions can 
give the same performance, but never better, than the rank-one 
solutions): 

Example 1. Consider a single-carrier point-to-point system 
(K t = K r = K c = 1) with two transmit antennas, the 
channel vector hn = [1 0] T , and the per-antenna power 
constraints tr { [ °] S 1:L } < 1 and tr { [ ° ] Sn} < 1. Under 
these conditions, ( |P1[ ) is solved optimally by both the rank-one 
single correlation matrix Sn = [ I0 ] and by the rank-two 
matrix Sn = [ ]. 

To prove the sufficiency of rank-one signal correlation 
matrices, we start with a lemma from |39|. 

Lemma 3. The convex optimization problem 

maximize a^Xa 

x^o (P3) 

subject to tr{B,X} < bi for i = 1, . . . , M, 

with arbitrary Hermitian matrices B; >: and scalars bi > 
Vi, has solutions with rank(X) < 1. 

Proof: The proof is given in [39} Appendix III]. ■ 
The main result in this subsection is the following theorem. 

Theorem 1. It holds for (JFTJ and |P2| that 

• If the problem is feasible, there exists single-stream 
beamforming solutions (i.e., rank(S^ c ) < 1 Vfc, c). 

Proof: Let {S^ c } fe ^^ be an optimal solution to ( |P1| >. 
For each such optimal signal correlation matrix, we can create 



the problem 
maximize hf,D fc S fcc Df h fec 

subject to hf c C s D fc S fcc Df Cf h kc < z\ kc Vfc * k, (8) 
tr{Q ; S fcc } < q ikc VZ, 

where z\ kc = hf c C s D fe S* c Df Cf h kc and q lkc = 
tr{Q;S£ c }. This problem tries to maximize the signal gain 
under the constraint that neither more interference is caused 
nor more transmit power is used than with S* kc . Obviously, 
S* kc is an optimal solution to <j8j (if better solutions would 
have existed, these could have be used to improve the utility 
in ( |P1| |, which is a contradiction). 

Now, observe that <|8j has the structure of ( |P3[ ) 
in Lemma gl (since hf c CfeD fc S fec Df Cf h kc 
tr{Df Cf hfe c hf c C s D fe S fec }). Thus, if S* kc has rank greater 
than one, Lemma [3] shows that there exist an alternative 
solution S* k * with rank(S£*) < 1. This solution can be used 
instead of S* kc without decreasing the performance. The proof 
for ((P2J follows along the same lines. ■ 

The optimality of single-stream beamforming both de- 
creases the search space for optimal solutions and makes the 
solution easier to implement (since vector coding or successive 
interference cancelation is required if rank(Sfe c ) > 1 (33]). 
Observe that rank(Sfc c ) < 1 in Theorem [TJ implies that the 
rank might be zero, which corresponds to Sfe c = (i.e., no 
transmission to MS/- at subcarrier c). 

Recently, similar optimality results for single-stream beam- 
forming have been derived for a few special multicell scenar- 
ios. The MISO interference channel was considered in [261 
and a certain class of multicell systems was considered in [28 1. 
Per-transmitter power constraints were considered in both | |26[ 
and J28) , making Theorem [TJ a generalization to arbitrary linear 
power constraints and our general multicell framework. 

B. Conditions for Full Power Usage 

If only the total power usage over all transmitters is con- 
strained, it is trivial to prove that the solution to ( |P1| | will use 
all available power. Under general linear power constraints, 
it may be better to not use full power at each transmitter or 
antenna; there is a balance between increasing signal gains 
and limiting co-terminal interference. This is illustrated by the 
following toy example: 

Example 2. Consider a two-user interference channel with 
single-antenna transmitters (Kt = K r = 2, N\ = N2 = K c = 
1) and the channel vectors hn = [1 -\/l/10] T and h.21 = 
[\Jl/2 1] T . BSj transmits to MSj and coordinates interference 
to both terminals, meaning that Di = D2 = [°°], and 

Ci = C2 = 12- The per-transmitter power is constrained as 
tr{Dj-Sji} < 20 Vj. 

Under max-min rate optimization with f{SINR\ 1 SINR2) — 
minfelog 2 (l + SINRk) and identical quality functions, the 
optimal solution to ( |P1| > is Sn = 10Di and S21 = 2OD2. 
This solution gives SINR\ = SINR2 = 10/3, and observe that 
only BS2 uses full power. If BSi would increase its power 
usage, then SINR2 would decrease and thus the performance 
would be degraded. 
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In principle, the knowledge that a certain power constraint is 
active removes one dimension from the optimization problem. 
The second optimality property provides conditions on when 
full power should be used in general multicell systems. 

Theorem 2. It holds for ((FT} and {P2} that 

• There exist solutions that satisfy at least one power 
constraint with equality. 

• If there are only per-transmitter power constraints, BSj 
should use full power if \Cj\ < Nj. 

Proof: For a given optimal solution {S^}^!.'^"-., as- 
sume that all power constraints in (HI are inactive. Let e — 
max; qi/(J2k=£ E c =i tr iQ* S L/) and observe that e > 1. 
Then, {eS£ c } fe _^ c c =1 will satisfy all power constraints and 
at least one of them becomes active. The performance is not 
decreased since the factor e can be seen as decreasing the 
relative noise power in each SINR in {5}. Thus, there always 
exist a solution that satisfies at least one power constraint with 
equality. 

The second part requires that there exists a k £ T>j with 
h jkc # span(Ufc eCjA{fc} {h 3 fc c }). If \Cj\ < Nj, this is satisfied 
with probability one for stochastic channels (with non-singular 
covariance matrices). Then, it exists a zero-forcing beamform- 
ing vector that can increase the signal gain at MS/- without 
causing interference to any other terminals in Cj \ {k}. By 
increasing the power in the zero-forcing direction until the 
power constraint for transmitter j is active, the second part of 
the theorem is proved. The details are along the lines of (25] 
Theorem 2], ■ 

The interpretation is that at least one power constraint 
should be active in the optimal solution. In addition, the fewer 
terminals that a transmitter coordinates interference to, the 
more power can it use. The second item in Theorem [2] can 
be relaxed to that full power is required at BSj if fewer than 
Nj terminals in Cj are allocated to some subcarrier c. 

C. Parametrization of Optimal Precoding Strategies 

The third optimality property is an explicit parametrization 
of the optimal solution to (jPTJ using K r K c + L parameters 
between zero and one. In this context, explicit means that the 
parameters give a transmit strategy directly, without having to 
solve any optimization problemfj Recalling that {FT} consists 
of finding K r K c complex-valued positive semi-definite trans- 
mit correlation matrices of size NxN, this constitutes a major 
reduction of the search space for the optimal solution. 

As a first step, we exploit the single-stream beamforming 
optimality in Theorem [TJ to derive a dual to the feasibility 
problem {P2}. The following lemma builds upon the line 
of work in fT3)-|fT6) and characterizes the solution to {P2} 
through the principle of virtual uplink-downlink duality. To 
simplify the notation, we first define I k as the set of terminals 
that transmitters serving MSfc are coordinating interference to. 
Formally, 

T k = |J Cj\{k}. (9) 

5 Fewer parameters can be achieved by using the K r K c QoS constraints 
{7fec} m |P2} as parameters (see [19 ]), but this is impractical since finding the 
corresponding transmit strategy means solving a convex optimization problem. 



Lemma 4. Strong duality holds between ( |P2| > and the Lagrange 
dual problem 



K r K c L 

maximize ^ ^ A fcc o^ c - ^ 
„Ji k=1 c=1 /=1 

subject to u>i > 0, Xkc > Vfc, c, I, 

,yv KrC ,u:,X c 



with u> = [u>x, . 

,VUL/ 



maxSINR^ L (wi c 



■ , w L ] T , A c = [Ai 



Ky 



(D2) 

= 7 fec , Vfe, c, 
] T , and 



SINR^ L (w lc ,...,w Kl . c ,u;,A c ) 

A fce wf c Dfh fcc hf c D fc w fcc 

wf c (I>Qi+ E Afe c Df Cf h Sc hf c C s D fc )w fec 



(10) 



k£l k 



The optimal utility of ( |D2[ ) satisfies Y2k=i Z~2f=i ^kcO~ 



T kc 

J2iLi u>l( l l ~ *-* and ^ e °pti ma l Sfc c in ( |P2[ ) is equal to the 
optimal WfcW^ up to a scaling factor. 

Proof: The proof is given in Appendix [A] ■ 
The main result in this subsection is the following theorem 
that exploits Lemma 4] to derive an explicit parametrization 
for the solution to {P^T 

Theorem 3. The optimal solution to {FT} can be expressed as 
Sfcc =J3fe c v fec vf c . For some choice of bj u \ kc <E [0,1] (I = 
1, . . . , L, k — 1, . . . , K r , c = 1, . . . , K c ), the optimal power 
allocation p kc and beamforming vectors v kc are given by 

t 



Vfcc 



fcex fc 



(Ef=i^Q/+ E Afc c DfCfhfc c hf c C s D fc ) Dfh fcc 



kei k 



(11) 



for all k, c and 

[Pic ■ • -PK r c] = [ 7lc <r? c . . . 7K pC 4 r J Ml Vc. (12) 



Here, the mnth element of M c S 



p K r x K r 



[M c 



\h H D v I 2 

\ 11 mc ±J rn v mc| j 



m = n, 



-7„ c |h^ c C„D m v mc | 2 , m^n, 



(13) 



and 



Ik, 



= A fcc hf c D fe (^ Wi Q 



A Sc Df Cf h, c hf c C,D fe ) f Df h fcc . 



(14) 



kei k 



Proof: Based on Lemma|2j there exists a set of {~/ kc } such 
that the solutions to {P2} are solutions to {FT}. Using these 
{jkc}> we a Pply Lemma EJ to achieve v k as the normalized 
solution to the generalized eigenvalue problem in {D2}. To 
determine p kc for all k and c, observe that since we consider 
the optimal solution to {FT}, all QoS constraints in {P2} needs 
to be satisfied with equality. These constraints gives K r K c 
linear equations that can be expressed and solved as in {12} . 
The alternative expression for j kc in {T5} is achieved from 
{34} in Appendix [A] 
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It remains to show that it is sufficient to search for Lagrange 
multipliers w;,Afc c G [0,1]. Dual feasibility in ( |D2| ) requires 

, Xkc > 0. Observe that ( fTT| and ( fT2| i are unaffected by 
a common scaling factor in w/,Afe c . Thus, if any of the 
variables is greater than one, we can divide all the variables 
with d = max(max/ oj;, maxfc iC Afc c ) and achieve a set of 
parameters between zero and one. Thus, it is sufficient with 
u h \ kc €[0,l]. M 

This theorem shows that the whole resource allocation in 
([FT} (i.e, precoding and scheduling over subcarriers) is gov- 
erned by K r K c + L real-valued parameters, each between zero 
and one. Thus, even if multiple base stations are involved in 
the transmission, there is only a single parameter per terminal 
and subcarrier. Remarkably, this simple structure holds for any 
outer utility function /(•) and terminal quality functions <?&(•) 
that are strictly monotonically increasing. 

The parameter \k c in Theorem [3] represents the priority of 
MS fc at subcarrier c. This is easily understood from ( fT3j ) by 
observing that 



d 



fee 



SINR^ = 




(15) 



Clearly, Afe c > only for terminals allocated to subcarrier c. 
In addition, all SINRs are decreasing functions of uji, and it 
is worth noting that oji = for all inactive power constraints. 

Similar parametrizations have been derived for single-carrier 
systems in |24)-p8). For i^ r -user MISO interference chan- 
nels, a characterization using K t (K r — 1) complex-valued 
parameters was derived in J24) . It was improved in pTJ by 
making them positive real-valued, and even earlier in |26| 
by using K t (K r — 1) parameters from [0,l]j^For multicell 
systems, K t (K r — 1) complex-valued parameters were used 
in p5| , which was improved to [0, 1] -parameters in |28) . 
Compared with this prior work, our new parametrization in 
Theorem [3] generally requires much fewer parameters; for 
instance, K t + K r instead of K t (K r — l) in MISO interference 
channels and other multicell systems with per-transmitter 
power constraints (i.e., K c = 1,L = K t ). In other words, 
the number of parameters increase linearly instead of quadrat- 
ically. However, it is worth noting that the parametrizations in 
[26|-[28| are superior in the special case of K t = K r = 2, 
simply because Theorem [3] handles any power constraint while 
only per-transmitter constraints are considered in [26|-|28|. 

Main applications of Theorem [3] are to search for parameter 
values iteratively or to select them heuristically. An example 
of the former is the search algorithm in |27], which cannot 
guarantee global convergence but satisfy a necessary condition 
on optimality. The well-known signal-to-leakage-and-noise 
ratio (SLNR) beamforming strategy in |34") , pO) corresponds 
to a certain heuristic selection of our parameters, and the 
regularized zero-forcing approach in [41] resembles the op- 
timal structure in Theorem [3] Thus, the theorem explains why 
these strategies perform well and demonstrates that even better 
performance can be achieved by fine-tuning the parameters. 

6 From a complexity perspective, there is little difference between parame- 
ters from [0, 1] and [0, oo) since it exists bijective mappings h : [0, oo) — > 
(0, 1] between these sets (e.g., h(x) = e~ x ). However, a bounded set as 
[0, 1] is more neat to use. 



V. Low-Complexity OFDMA Resource Allocation 

The resource allocation problem in is generally NP- 
hard, making the optimal solution practically infeasible. We 
will therefore propose low-complexity strategies for OFDMA 
resource allocation that exploit the optimality properties in 
Section [IV] Despite the huge reduction in computational 
complexity, these strategies provide close-to-optimal perfor- 
mance in the measurement-based evaluation of Section [VlJ In 
addition, the optimal multiplexing gain is achieved in certain 
scenarios. 

Theorem [3] parameterized the optimal solution to ( |P1) , thus 
good performance can be achieved by judicious selection of 
the parameters Afe c and u>i. An important observation is that 
{FT} allocates terminals over subcarrier as an implicit part 
of the optimization problem. In the parameterization, this 
scheduling is explicitly represented by having Afe c > for all 
terminals k that are scheduled on subcarrier c. Thus, heuristic 
parameter selection requires an efficient subcarrier scheduling 
mechanism. Herein, we adopt and extend the state-of-the-art 
ProSched algorithm from BTJ. 



A. Centralized Resource Allocation 

For notational convenience, we only consider per- 
transmitter power constraints, weighted sum performance 
f(gi, ■ ■ -,gK r ) = J2k Mfc5fe. and quality functions g k {-) that 
can be decomposed as 



9k 



({SINR^f:, 



z2gkc 

c=l 



(SLNRt 



(16) 



where all gkc(-) are concave functions. This structure holds 
for both data rates, mean square errors (MSEs), and symbol 
error rates (SERs), as will be shown below. 

As the parametrization in Theorem [3] builds upon virtual 
uplink optimization, we call our strategy centralized virtual 
SINR (CVSINR) resource allocation. It is outlined as follows: 

1) Consider weighted sum optimization f(gi,... ,gK r ) = 
Sfe Mfefffc f° r some collection of weights fi^ > 0. 

2) Allocate terminals S c C JC to subcarrier c using an 
appropriate algorithm (e.g., ProSched (3TJ). 

3) Set A fec = lik\Sc\/{ol c Y.kes c Ms) if MS fc is scheduled 
on subcarrier c, otherwise set \ kc = 0. 

4) Set oji = K c /qi and calculate the signal correlation 
matrices Sfc c using Theorem [3] 

5) Rescale all Sfc c , according to Theorem [2] to satisfy all 
power constraints (and some with equality). 

This CVSINR strategy allocates terminals to subcarriers 
and then calculates a single-stream beamforming strategy in 



compliance with the optimality properties in Section IV The 
parameters uji , Xk c are selected heuristically, and the details 
are provided later in this section. 

B. Distributed Resource Allocation 

The drawback of any centralized strategy, including the 
proposed CVSINR strategy, is that resource allocation requires 
global CSI. In a system with many transmitters/receivers and 
many subcarriers, this requires huge amounts of backhaul 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 59, NO. 12, DECEMBER 2011 



8 



signalling. In addition, joint CSI processing typically means 
large computational demands. Therefore, our main focus is to 
derive a low-complexity distributed version of CVSINR based 
on local CSI. It will be suboptimal in terms of performance, 
but have much more reasonable system requirements than 
centralized approaches. 

Under single-stream beamforming, we have 
and divide the collective beamforming vectors as 



w fec wf c 



y/PK t kcVK tkc ] ■ Here ' v jfcc G 



is the 



unit-norm beamforming vector and pj kc > is the power 
allocated by BSj for transmission to MS& (only non-zero for 
k 6 T>j). With this notation, ( |P1| | becomes 



Kr 



K c 



maximize 

, -,K t ,K r ,K c 

t -,K t ,K r ,K c 
iPjkci j = lk = i c= i 

subject to 



^2^k^2 he 



(SINR 



kc j 



(17) 



k = l 



SINR 



DL 

kc 



Pjkc > 0, \\v jkc \\ = 1 Vj, fc,c, 

C,;D < v ,, 



Pjkc n 'jkc y ^ikl-' J k Vjkc | 




Vfc, C, 



^2^2pjkc < qj vj. 

fc=l c=l 

An important question is how to maximize SINR^ in ( fl7j ) 
distributively using only local CSI. Starting with the numera- 
tor, coherent signal reception can still be achieved, for instance 
by requiring that v /p :( -fc c hj| c D j^v jkc should be positive real- 
valued for every transmitter. Achieving coherent interfer- 
ence cancelation (i.e., that | J2 je J V^ h jfec C jfe D ifc v jfec| 2 
is small without enforcing that every term is small) is more 
difficult under local CSI, if not impossible in large multiuser 
systems]^] Without coherent interference cancelation, there are 
few reasons for joint transmission; it is more power efficient 
and reliable to only use the transmitter with the strongest link, 
although somewhat more unbalanced interference patterns are 
generated. Therefore, each terminal is only served by one 
base station at each subcarrier in our distributed strategy — but 
different transmitters can be used on different subcarriers. This 
assumption greatly reduces the synchronization requirements, 
while the performance loss is small or even nonexistent (see 
Section |VI) , 

The resource allocation problem in ( fT7| ) can be divided 
into three parts: 1) Subcarrier allocation; 2) Power allocation 
{Pjkc}', and 3) Beamforming selection {vjkc\- Our distributed 
strategy solves them sequentially, only requiring local CSI 
at each transmitter in each part (i.e., h,-fc c is known at BS^ 
for terminals k G Cj). In between each step, a small amount 
of signaling is used to tell each transmitter which terminals 
that are served by adjacent transmitter (to enable interference 
coordination). Next, we describe the three steps in detail. 

C. Step 1: Subcarrier Allocation 

The goal of this step is to select the scheduling set S n (j, c) 
with terminals that are served by BSj at subcarrier c. The 

'iterative optimization can be used, but it requires backhaul signaling and 
is sensitive to CSI uncertainty, delays, etc. 



subscript n denotes the current scheduling slot. Observe that 
the performance of subcarriers is only coupled through the 
power constraints. Thus, it is reasonable to perform indepen- 
dent user scheduling on each subcarrier. The proposed scheme 
is a generalization of the ProSched algorithm in [31] and |42|, 
where the interference generated on terminals served by other 
transmitters, 

A n (j,c) = \J(S n (i,c)nC j ), (18) 

is also taken into consideration. For a given set A, the 
scheduling metric for terminal k is 



/a \\h H p( S - A )||2 > 



Ijkc 



Hk9kc 



4 c Kc\S\ 



(19) 



where 'f^ denotes the projection matrix onto the null space 

of channels for terminals in (<S U A) \ {k}. Thus, r]^.'^ 
represents the performance with equal power allocation and 

zero-forcing precoding. To lower the computational complex- 

~ (s) 

ity, the ProSched algorithm calculates P^ us i n g an efficient 
approximation (see |42|). In the search for the scheduling set 
S with the highest sum metric 



(S,A) _ (S,A) 
kes 



(20) 



the ProSched algorithm avoids the complexity of evaluating 
it for every possible S C Cj by performing a greedy tree 
search. Despite all simplifications, ProSched has shown good 
performance under reasonable complexity |31|, [42 1. Our 



distributed ProSched algorithm exploits time correlation and 
selects S n (j,c) as follows: 

1) Start with S*™ p (j, c) — S n -i(j,c) and knowledge of 

A n -l(j, c). 

2) Use the "Tracking and Adaptivity"-procedure in [42 1 to 
add and remove terminals from S l ™ p (j,c), while zero 
interference is generated to terminals in A n -i(j, c). The 
final set needs to satisfy \ST p (j, c)UA n -i[j, c)| < 
The sum metric is evaluated using (20) . 

3) Set S n (j,c) = S l ™ p (j,c) and send it to central station. 

4) Central station calculates A n {j, c) and sends it to BSj. 

The main difference from the original ProSched algorithm is 
the existence of A n -i{j, c), which are terminals that BSj must 
coordinate interference towards. The algorithm exploits time 
correlation by taking new decisions based on previous ones; it 
tries to remove users from the selected set and then add other 
users. The user weights can updated between scheduling slots, 
although not reflected in our notation. The initialization can 
be achieved in some arbitrary way, for example by selecting 
the strongest user as S n (j, c) = argmax^g-p^-j^' \ The last 
step of the algorithm prepares for the next scheduling slot and 
is used in the next steps to adapt the precoding to the current 
subcarrier allocation. 



8 A feature of the approximate zero-forcing precoding is that the sum metric 
will be non-zero also for \S'™ P (j , c)UA n -i(j , c)\ > Nj , but this corresponds 
to an interference-limited system and should be avoided. 
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D. Step 2: Power Allocation 

The difficulty in distributed power allocation is that the 
interference powers generated by other transmitters are un- 
known. Fortunately, the proposed subcarrier allocation is de- 
signed to make \S n (j,c)\JA n (j,c)\ < Nj so that zero-forcing 
precoding exists)^] Power allocation based on zero-forcing 
simplifications has been shown to provide accurate results 
(e.g., in the context of the ProSched algorithm), although 
better beamforming vectors are used in the end. Based on this 
assumption, the SINR of MSfc at subcarrier c becomes 



SINR° C L = Pjkc 



H ZF 12 
jkc jkc\ 



VkeS n (j,c) (21) 



kc 



-Pjkc 



where vW c is the unit-norm zero-forcing vector for terminals 
in S n (j, c) U A n {j, c). For fixed Pj kc , the distributed power 
allocation can be solved as follows. 

Lemma 5. For a given transmitter index j, some given channel 
gain constants pj kc > 0, and differentiable concave functions 
9kc{ ) with invertible derivatives, the power allocation problem 

A',. 



maximize Y] V* PkhdPjkcPjkc) 

c=i fces„o,c) 



A',. 



(22) 



subject to 

is solved by 

Pjk 



Po^ < qj 

c=i kes n (j,c) 



1 



max i — s'kc , 

Pjkc \PkPjkc 



V 



,0 



(23) 



where -^g kc {x) = g'k c ( x ) an d ^ > is selected to satisfy the 
constraint with equality. 

Proof: This convex optimization problem is solved by 



standard Lagrangian techniques |35|. ■ 
The distributed power allocation depends on the inverse 
of the derivative of the terminal quality function gkc(')- To 
exemplify the structure, we let the quality functions either 
describe the data rate, MSE, or Chernoff bouncf*"] on the SER 
for an uncoded M-ary modulation: 



log a (l + s) 



1 



a'k/iv) = J - !. 
l 



i 



he (y) = 



Vv 



(24) 



(25) 



TABLE I 

Distributed Virtual SINR (DVSINR) Resource Allocation 



set power threshold r. 

for each transmitter j at scheduling slot n: 

set ST P (j, c) = S n -i(j, c). 

perform the "Tracking and Adaptivitv"-proc edure [42] 



V-C 



on S l n V (j, c) with the special rules in Section 
set S n (j, c) = cSjJ" p (j, c) and send it to central station, 
attain An(j, c) from central station, 
calculate Pjkc f° r k S S n (j,c) using Lemmapl 
remove terminals with pjk c < r from S n (j, cy 
send updates of S n (j,c) and attain updates of A n (j,c)- 
calculate uij and Afc c using \29\ and j28j. 
calculate Vj^ c for k £ S n (j, c) using Corollary [T] 
end 



In ( p6] i, z — 3/(M 2 — 1) for pulse amplitude modulation 
(PAM), z = sin 2 (7r/M) for phase-shift keying (PSK), and 
z — 3/ (2M — 2) for quadrature amplitude modulation (QAM). 

For all the listed quality functions, the power allocation 
in Lemma [5] has the waterfilling behavior, which means that 
terminals receive more power on strong subcarriers than on 
weak. In addition, the system prioritizes terminals with large 
weights. Some terminals might be allocated zero or negligible 
power (below some threshold r). These terminals should 
immediately be removed from the scheduling set S n (j, c), and 
adjacent base stations should be informed so that all A n (j, c) 
can be adjusted. This requires some extra signaling, but will 
avoid unnecessary interference coordination and improves the 
scheduling in the next slot. 

E. Step 3: Beamforming Selection 

The parametrization in Theorem [3] provides the optimal 
structure of the beamforming directions. Since at most one 
transmitter serves each terminal at each subcarrier, we have 
the following corollary. 

Corollary 1. Assume that all S n (j,c) are disjunct and that 
BSj has a per-transmitter power constraint of qj. For BSj, 
the optimal beamforming direction to user k £ S n (j, c) at 
subcarrier c is given by 



(wjIjv, + ±2 X- kc h fkc hf ic 



V 7 -fe c 



kei k 



kc 



keik 



>-jkc 



(27) 



for some positive parameters cjj and {^kc}^^" 
To use Corollary [TJ the parameters 



l,c=l' 

and 



9 Since the subcarrier allocation makes |<S„ p (j',c) U A n —i{j, c )\ < Nj 
with A n -i{j, c) instead of A n (j,c), it might occasionally happen that 
\Sn(j,c) U An(j, c)| > Nj. This is either handled by having a central 
mechanism that removes users or by ignoring the weakest unserved terminals 
in An{j, c) in the power allocation step. The latter will have limited impact 
on performance since most of the interference coordination comes from the 
beamforming directions and not from power allocation. 

10 The exact SER can also be used, but there are no closed-form expressions 
for the inverse of its derivative. 



Aj. c need 

(M — l)z to be selected heuristic ally. For this reason, recall that the 
My parametrization is achieved using uplink-downlink duality. 

(26) Thus, utj is inversely proportional to the SNR of the virtual 
uplink channels. As the parameter is user-independent, it is 
only affected by the transmit power of BSj and not of any 
noise parameter. We therefore select 



(heuristic) 



(28) 



Next, we consider X kc and recall from ( fT5] l of Section IV 



that X kc represents the priority of MSjg, and whether or not 
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the terminal is scheduled at subcarrier c. The best priority 
indicators that we have are the weights /ik in (Vl) , but we need 
to normalize them based on which users are scheduled. Finally, 
A^, c should be inversely proportional to the noise power <r| 
of MS;., since this term could not be included in the user- g 
independent ujj -parameters. We therefore select 



A 



(heuristic) 



if k G SAn (j, c) 
otherwise 



(29) 

where SA n (j,c) = S n (j,c) U A n (j, c), for notational conve- 
nience. The normalization was performed such that the pro- 
posed scheme reduces to the well-studied SLNR beamforming 
strategy in [34) if all user weights are identical and all noise 
terms are identical. Observe that different transmitters can have 
different heuristic values on A^ h c eullstlc ' 1 , representing the local 
terminal priority. 



F. Final Strategy 

The proposed distributed resource allocation strategy is 
summarized in Table [I] The strategy is named distributed 
virtual SINR (DVSINR) resource allocation, since it based on 
optimization of virtual SINRs as in Lemma |4] In addition, it 
reduces to the DVSINR beamforming scheme in ]25) in certain 
single-carrier scenarios. 

The proposed CVSINR and DVSINR strategies are both 
suboptimal, but for a given subcarrier scheduling they can 
achieve asymptotic optimality in terms of multiplexing gain: 

Theorem 4. Let S c C {1, . . . , K r } be the terminals scheduled 
for transmission on subcarrier c. If |<S C n Cj\ < Nj for all j 
and c, then the CVSINR and DVSINR strategies achieve the 
multiplexing gain of \&c\ (with probability one). 

Proof: The theorem follows by the same approach as in 
1 25 Theorem 5] and exploits that random channels are linearly 
independent with probability one. ■ 
This means that the weighted sum rate behaves as 



Ec=i l^c|) log 2 (-P) + constant at high transmit power P. 
Thus, the absolute performance losses (also called power 
offsets) of the CVSINR and DVSINR strategies are bounded 
compared with the optimal solution, and the relative loss goes 
to zero as constant/ log 2 (P) with increasing transmit power. 

VI. Measurement-Based Performance Evaluation 

The theoretical performance of coordinated multicell trans- 
mission has been thoroughly studied for single- and multi- 
carrier systems (see e.g., |6j, fl2) , (2TJ, p2|). Under Rayleigh 
fading channels and perfect synchronization, large improve- 
ments over single-cell processing have been reported. Es- 
pecially, cell edge terminals benefit from inter-cell interfer- 
ence coordination. However, results obtained from numerical 
simulations are highly dependent on the assumptions in the 
underlying channel models. For example, it is common to 
model the channel characteristics between a terminal and 
multiple base stations as uncorrelated, although correlation ap- 
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(a) Routes in Evaluation Scenario A. 
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pears in practice |23 1. Along with other idealized assumptions 
(e.g., on fading distributions and path losses), such channel 
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(b) Routes in Evaluation Scenario B. 

Fig. 2. Downlink evaluation based on measurements in an urban environment. 
Two four-antenna base stations are serving multiple single-antenna terminals. 
These terminals are either randomly located on the measured routes marked 
in Scenario A or move along fixed routes as in Scenario B. 



dependencies may affect the performance of any multicell 
system. 

The purpose of this section is to evaluate the performance 
of the low-complexity strategies in Section [V] on realistic 
multicell scenarios based on channel measurements. We only 
consider a single subcarrier for computationally reasons (the 
optimal solution to (JFTJi can be calculated systematically using 
p9| ) and since our measurements are flat-fading. However, the 
subcarriers in weighted sum rate optimization are only coupled 
by the power constraints and OFDMA systems are known for 
giving almost flat power allocation over subcarriers (43). Thus, 
we expect our single-carrier results to be representative for 
general OFDMA systems. 

The channel data was collected in Stockholm, Sweden, 
using two base station^] with four-element uniform linear 
arrays (ULAs) with 0.56A antenna spacing and one terminal. 
The terminal had a uniform circular array (UCA) with four 
directional antennas, but herein we average the signal over its 
antennas to create a single virtual omni-directional antenna. 

"Channel data from a third base station, co-located with BSi, was also 
measured in [ 23]. However, the overlap in coverage area between this base 
station and BSi and BS2 in Fig.|2]is small, and therefore it is not used herein. 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 59, NO. 12, DECEMBER 2011 



11 



90 



3 


80 






!q 


70 


<B 




rr 


60 


Sum 


50<> 






Weighi 


30 


•age 


20 


Avei 


10 



■ Optimal 
■CVSINR 
— 8 — Optimal (incoherent) 

DVSINR 

Coordinated ZF 

Single-cell process. 



10 12 14 
Total Number of Terminals 



16 



1£ 



20 



(a) 20 dBm output power per transmitter. 




■0 — Optimal 
CVSINR 
■3 — Optimal (incoherent) 
DVSINR 
Coordinated ZF 
Single-cell process. 



6 8 10 12 14 16 

Total Number of Terminals 

(b) dBm output power per transmitter. 



20 



Fig. 3. Average weighted sum rate (over random terminal locations) for 
Scenario A with different number of terminals. The performance is shown for 
different resource allocation strategies. 



The system bandwidth was 9.6 kHz at a carrier frequency 
in the 1800 MHz band. The measurement environment can 
be characterized as typical European urban with four to six 
story high stone buildings. Fig. [2] shows the measurement area 
with roads illustrated in light gray and blue routes showing 
the GPS coordinates of the terminal locations used for the 
simulations herein. Further measurement details are available 



in \ 23\. The collected channel data is utilized to generate two 
evaluation scenarios where terminals are moving around in the 
area covered by both base stations: 

• Scenario A: The performance behavior is evaluated over 
different random terminal distributions. In each snapshot, 
terminals can be located anywhere on the measured routes 



in Fig. 2(a) with uniform probability. To create balance, 
two terminals are placed to have their strongest channel 
gains (||hjfc|| 2 ) from BSi, and the same for BS2- 
Scenario B: To study the impact of coordination on 
individual terminals, four terminals are now placed at 



certain locations and moved as indicated in Fig. 2(b) 
In both scenarios, the performance will be evaluated as a 
function of the output power per base station (in dBm). The 
noise level is set to —131 dBm (i.e., thermal noise and a few 
dBs of transmitter noise) and the measured path losses from 
the strongest base station varies between —37 dB and —85 dB 
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Fig. 4. Cumulative distribution function of the individual terminal rates (over 
random terminal locations) for Scenario A with K r = 4. The performance 
is shown for different resource allocation strategies at dBm and 20 dBm 
output power. 



for different terminal locations in Fig. [2] 

The performance measure is the weighted sum rate with 

C„/E{log 2 (l + j£±r maxj P 3 ■ || 1 1 2 ) >, where c w is a 



/'A- 



scaling factor making Ylk=i Mfc ~ ^r- This represents propor- 
tional fairness (with equal power allocation). Six transmission 
strategies are compared: 

1) The optimal solution (calculated using the framework in 



2) 



1 19 1 and using the optimization software CVX (36].). 
The optimal solution under incoherent interference re- 
ception, where base stations cannot cancel out each 
other's interference using joint transmission. This case is 
relevant since very tight synchronization and long cyclic 
prefixes are necessary to enable coherent interference 



cancelation over wide areas [34]. We represent it by 
replacing the SINR in (Jl7]i with 

1 E VPj fcch jfec D jfc v jfcc| 2 



SINR 



DL- incoherent 

kc 



jk jkc | 

(30) 

3) Centralized virtual SINR (CVSINR) strategy, proposed 
in Section IV-AI 

4) Distributed virtual SINR (DVSINR) strategy, proposed 
in Section IV-BI 

5) Coordinated ZF precoding with single-cell scheduling 
using ProSched. 

6) Single-cell processing as if there is only one cell in the 
system. The average out-of-cell interference is included 
in the cr^-terms. The resource allocation is based on the 
DVSINR strategy (pretending that K t = 1). 



A. Results for Evaluation Scenario A 

The scheduling performance is evaluated in Fig. [3] over 
different random terminal locations, each used for 10 channel 
realizations. The average weighted sum rate is given as a 
function of the total number of terminals at 20 dBm and dBm 
output power per transmitter. The proposed CVSINR strat- 
egy provides close-to-optimal performance, especially when 
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Fig. 5. Average weighted sum rate as a function of the output power 
for Scenario B. The performance is shown for different resource allocation 
strategies, including the proposed CVSINR and DVSINR strategies. 



Fig. 6. Average individual terminal rates for Scenario B with and without 
multicell coordination. The proposed DVSINR strategy (triangles) is compared 
with single-cell processing. 



the number of terminals increases. The gap to the optimal 
solution is remarkably small, given that CVSINR is a simple 
combination of ProSched scheduling and heuristic use of the 
optimality properties derived in Section [IV] — further param- 
eter tweaking can certainly reduce the gap. The distributed 
strategies (DVSINR and coordinated ZF) stabilize on about 
half the performance of the centralized schemes, representing 
that only half the number of terminals can be simultaneously 
accommodated. One might think that this is due to that 
only one transmitter serves each terminal, but the actual 
explanation is that (non-iterative) distributed schemes cannot 
achieve coherent interference cancelation. This is understood 
by the comparably small difference from Strategy 2 which 
includes joint transmission but have incoherent interference 
reception, and it confirms the discussion in Section V-B 
All the studied centralized and distributed strategies provide 
improvements over single-cell processing, and the differences 
increase rapidly with the output power. 

Next, we want to study how multicell coordination impacts 
the performance of each terminal and we set K r = 4 to make 
sure that all six strategies consider the same set of terminals. 
In Fig. [4] the cumulative distribution functions (cdfs) of the 
individual terminal rates are given for output powers dBm 
and 20 dBm. The proposed CVSINR strategy is very close 
to the optimal solution, in particular at high output power. 
The difference between the optimal solution and the DVSINR 
strategy increases with the SNR, but the distributed approach 
is close to the optimum under incoherent interference, which 
might be the most reasonable upper bound in practice [34|. 
Both CVSINR and DVSINR provide great improvements over 
single-cell processing — especially at high output power. The 
coordinated ZF scheme performs poorly at low output power, 
but approaches DVSINR at higher power. To summarize, 
terminals that move around in the cell clearly benefit from 
multicell coordination on the average through better statistical 
properties. In Scenario B, we will however see that terminals 
that are fixed at certain locations may experience performance 
degradations. 
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Fig. 7. Weighted sum rate for Scenario B as a function of phase standard 
deviation a^ c (at 20 dBm output power). The actual channels are modeled as 
h jfe e^, where e Af(0, <r|). 
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B. Results for Evaluation Scenario B 

For Scenario B, the average weighted sum rate (per channel 
use and over 750 channel realizations) is shown in Fig. [5] 
Once again, the proposed CVSINR strategy provides close- 
to-optimal performance. As in Scenario A, there is a clear 
gap to the distributed approaches, explained by fewer degrees 
of freedom in the interference cancelation. However, both 
DVSINR and coordinated ZF achieve the maximal multi- 
plexing gain in certain scenarios (see Theorem [4|, while 
the performance of single-cell processing is bounded at high 
output power. Observe that the major gain over single-cell 
processing comes from interference cancelation for terminals 
in Cj\ the difference between DVSINR and optimal joint 
transmission is comparably small (5 — 10 dB). 

Fig. [6] shows the average individual terminal rates for 
DVSINR (marked with triangles) and single-cell processing. 
Interestingly, the increased weighted sum rate with multicell 
coordination does not translate into a monotonic improvement 
of all terminal rates. Terminal 3 has almost equally strong 
channels from both base stations and therefore gain substan- 
tially from interference coordination. However, Terminal 2 has 
a very weak link to BS2 and sees a decrease in performance for 
output powers smaller than 10 dBm. This is explained by BSi 
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modifying its precoding to avoid interference at Terminals 3 
and 4. Thus, the common claim that multicell coordination 
improves both the total throughput and the fairness is not 
necessarily true in practice. However, Scenario A showed that 
as terminals move around in the whole area, they will on 
average benefit from multicell coordination. 

The analysis has thus far considered perfect base station 
synchronization, which cannot be guaranteed in practice due 
to CSI uncertainty, hardware delays, clock drifts, insufficient 
cyclic prefixes, and minor channel changes. We emulate 
these mismatches by letting the actual channels be h^" al = 
^■jkc e ihc for some random phase deviations <j)j}. c £ A/ (0, cr?) 



(where = means perfect synchronization). In Fig. [7 
the average weighted sum rate is shown as a function of the 
phase standard deviation (at 20 dBm output power). The 
optimal solution and the CVSINR strategy are very sensitive 
to synchronization errors as they rely on coherent interfer- 
ence cancelation where interfering signals from different base 
stations should cancel out perfectly. The DVSINR and coor- 
dinated ZF strategies are unaffected by such synchronization 
errors, and the gap to the optimal schemes based on incoherent 
interference reception reduces with a^. We conclude that 
very tight synchronization is required to gain from centralized 
multicell coordination with joint transmission. 

VII. Conclusion 

A general multicell OFDMA resource allocation framework 
was introduced with dynamic cooperation clusters that enables 
unified analysis of anything from interference channels to ideal 
network MIMO. Joint precoding and scheduling optimization 
was considered using arbitrary monotonic utility functions 
and linear power constraints. This problem is typically non- 
convex and NP-hard, but we proved three properties of the 
optimal solution: 1) Optimality of single-stream beamforming; 
2) Conditions for full power usage; and 3) A precoding 
parametrization based on K r K c + L real-valued parameters 
between zero and one. These properties greatly reduces the 
search space for optimal resource allocation. To illustrate their 
usefulness, we proposed the centralized CVSINR strategy and 
the distributed DVSINR strategy. Both exploited the three 
optimality properties in conjunction with efficient ProSched 
subcarrier scheduling. 

Contrary to previous work, the multicell performance was 
evaluated on measured channels in a typical urban macro- 
cell scenario. Substantial performance gains over single-cell 
processing were observed for both CVSINR and DVSINR. 
The former is even close-to-optimal, while the latter performs 
closely to what can be expected from distributed schemes 
(since coherent interference cancelation is more or less im- 
possible to achieve). This is remarkable since both CVSINR 
and DVSINR are just simple applications of the derived opti- 
mality properties — further parameter tweaking and adaptation 
to special scenarios are possible. The performance evaluation 
also showed that joint transmission is very sensitive to syn- 
chronization errors and that multicell coordinated improves the 
average terminal performance, but that terminals in some parts 
of the cells can experience performance degradations. 



Appendix A 
Proof of LemmaO 

Using Theorem [T] it is sufficient to consider solutions to 
( |P2| i of the form S kc = WfeW^. Without loss of generality, 
we can select the phase of w kc such that h^D/jW^c > 0. 
Under these conditions, the feasibility problem |P2} can be 
expressed as 

minimize 



subject to ££tr{Q,S fec } < q t V7, 

k=l c=l 
h fc C fc D X fc (l) w I fc (l)c 



(31) 



< 



Vfe, 



hf c C fe D Ifc (|i fc |)W Xfc( |x fc |) 

Cfcc 

Since the QoS constraints are second order cones, and the 
power constraints are semi-definite, this problem is convex 
and strong duality holds (35). To derive the Lagrange dual 
problem, we rewrite the Lagrangian of ( (31) similarly to 1 15 
Proposition 1]: 

k=l c=l 1=1 

1=1 kex k 



k=l c=l 



Ike 



D£h fcc h&D fc w fec . 



(32) 



Minimizing with respect to {\Vk c } u=i c=i 8 lVes a finite solu- 
tion only if 



(5>iQi+ £A Sc DfCfh Sc hf c C s D A 



i=i 



feeXfc 



(33) 



A, 



Ike 



-Dfh fcc hf c D fe ) ^0 Vfc,c, 



and the minimum is achieved by w kc = 0. Using 1 15 Lemma 



1], this dual feasibility constraint can be rewritten as 

Ike > A fcc h^D fc (£w»Q, + £ A £c Df Cf hfc C hf c C £ D fc 

1 kei k 

Ike 1 

A fce wf c Dfh fcc hf c D fc w fce 

^ X wf c Q>Q*+ ^ A Sc Df Cfh Sc hf c C s D fe )w, c 

; feel* 

(34) 

where the equality follows from introducing maximizatiorp] 
over an extra variable Wj. c £ C Wxl . Observe that this 

12 In general, (£. Qj +Efe e i fc HcPk C f HM^k^k) is rank- 
deficient, thus it might seem like there are solutions to j34| that achieve infinite 
utility. However, this is not the case as such solutions require h^,DfeW^ c ^ 
and from |33j it is clear that the denominator is non-zero for such solutions. 
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maximization corresponds to a linear MMSE receiver with 
optimum at 

l . 
wL = A te (5>«Qri-E A 5c Df Cf h Sc hf c C £ D fe ) Df h fcc . 

(35) 

By plugging Wfe c = into < [32] >, we achieve a Lagrange dual 
problem to ( (31) and ( |P2] i: 

maximize ^ ^ A fcc o| c - ^ (36) 
fe=1 c=1 J=1 

subject to Afe c > 0, > Vfc, c, I, 

max SINR™ L (w lc , . . . , w KrC , u, A c ) < j kc , Vfc, c. 

At the optimum, all SINR constraints are active (otherwise 
we could increase some Afc c and thereby increase the cost 
function), thus we replace them with equality constraints and 
achieve ( |D2[ ). 

Finally, to find a relationship between optimal \v kc and Wfc c 
we consider the stationary principle of the Lagrangian in ( (32] i: 

o dc 

dw kc 

L 

= 2(j>iQi + E A Sc Df Cf h^ c hf c CfeD fe (37) 

' =1 feel* 

- ^Df h fcc hf c D fc )w fec . 

7fcc ' 

By defining the scalar dk c — Afc c h^DfcWfe c /7fc c , we achieve 
w fec = 4c(E^Q'+E A Sc Df Cf h Sc hf c C s D fe ) Df h fcc 

i = 1 feGlfc 

(38) 

and identify Wfc c from ( [3~5) l (which is unique up to scaling). 
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